Equivalence , Containment and Rewriting of Aggregate Queries
نویسنده
چکیده
The primary goal of this thesis is to lay the theoretical foundations for a formal study of aggregate query optimization. This requires gaining a coherent understanding of equivalences and containments between aggregate queries of varied forms. A secondary goal of this thesis is to solve the view usability problem for varied types of aggregate queries. The view usability problem is that of determining whether it is possible to find a reformulation of a query that uses other queries, instead of base relations. This is of importance in query optimization, since it enables reuse of previously computed results. It is also a necessity when integrating information from heterogeneous sources, when each source is described as a view over a global schema. In order to achieve the primary goal, the general problem of determining equivalences among aggregate queries is considered. Our aggregate queries may contain disjunctions, comparisons and negated subgoals. First, the bounded equivalence problem, i.e., the problem of determining equivalence of queries with respect to databases whose size is bound by a constant, is studied. A complete characterization of when bounded equivalence is decidable is presented. It is shown how to determine equivalence by a reduction to local equivalence (a special case of bounded equivalence). Characterizations of equivalence for the special case of quasilinear queries are presented. Equivalence of quasilinear queries can be determined in polynomial time. Containment of aggregate queries is reduced to equivalence of aggregate queries. All the characterizations are couched in terms of abstract characterizations of aggregation functions. Thus, the results presented are easily extendible to additional aggregation functions. These results also solve additional open problems, including
منابع مشابه
Algorithms for Rewriting Aggregate Queries Using Views Algorithms for Rewriting Aggregate Queries Using Views Algorithms for Rewriting Aggregate Queries Using Views Algorithms for Rewriting Aggregate Queries Using Views
Queries involving aggregation are typical in database applications. One of the main ideas to optimize the execution of an aggregate query is to reuse results of previously answered queries. This leads to the problem of rewriting aggregate queries using views. Due to a lack of theory, algorithms for this problem were rather ad-hoc. They were sound, but were not proven to be complete. Recently we...
متن کاملAlgorithms for Rewriting Aggregate Queries Using Views
Queries involving aggregation are typical in database applications. One of the main ideas to optimize the execution of an aggregate query is to reuse results of previously answered queries. This leads to the problem of rewriting aggregate queries using views. Due to a lack of theory, algorithms for this problem were rather ad-hoc. They were sound, but were not proven to be complete. Recently we...
متن کاملAlgorithm for Term Linearizations of Aggregate Queries with Comparisons
We consider the problem of rewriting queries based exclusively on views. Both queries and views can contain aggregate functions and include arithmetic comparisons. To study the equivalence of a query with its rewriting query, the so called ”linearizations of a query” need to be computed. To find the linearizations of a query, the linearizations of terms from the query need to be generated. We p...
متن کاملEquivalence of Queries with Nested Aggregation
Query equivalence is a fundamental problem within database theory. The correctness of all forms of logical query rewriting—join minimization, view flattening, rewriting over materialized views, various semantic optimizations that exploit schema dependencies, federated query processing and other forms of data integration—requires proving that the final executed query is equivalent to the origina...
متن کاملThe complexity of higher-order queries
Higher-order transformations are ubiquitous within data management. In relational databases, higher-order queries appear in numerous aspects including query rewriting and query specification. This work investigates languages that combine higher-order transformations with ordinary relational database query languages. We study the two most basic computational problems associated with these query ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2004